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(54) Method and system for estimating the robustness of algorithms for generating 
characterizing information descriptive of a selected text block 



(57) A method and system for selecting a character- 
ising algorithm to be used to characterize blocks of print- 
ed material. A digital image of printed material, such as 
an address block, on an object is obtained, and the im- 
age is processed to abstract characterizing information 
descriptive aspects of the other printed material. The 
characterizing information is combined with other infor- 
mation, such as postal information, and the combined 
information is then cryptograph ically authenticated with 
a digital signature or the like, An indicium representative 
of the authenticated information is then printed on the 
object. The object's relationship to the indicium can be 



ws 



verified by regenerating the characterizing information 
from the other printed material and comparing the re- 
generated characterizing information with characteriz- 
ing information recovered from the indicium. Thus cop- 
ies of the indicium cannot easily be used, without detec- 
tion, on other objects which do not include the other 
printed material, A particular algorithm is selected from 
a predetermined group of characterizing algorithms by 
determining an estimate for the robustness of each al- 
gorithm with respect to small changes in the image 
caused by the printing and scanning process or by de- 
facing of the printed address by blots or the like. 
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Description 

[0001] The subject invention relates to the problem of providing a robust, compact characterization of a block of 
printed text which will distinguish the selected block of text from other such blocks. More particularly, it relates to the 

5 problem of estimating the robustness of algorithms for generating characterizing information descriptive of printed 
material. (By "robust and compact" herein is meant information which is small enough in quantity to be incorporated 
into postal indicia yet will Identify a text block, and distinguish it from other text blocks, with sufficient reliability to deter 
"rubber stamp" counterfeiting; despite errors introduced by the printing and/or scanning processes.) 
[0002] Postage metering systems account for postage and other values such as parcel delivery service charges and 

10 tax stamps, and print indicia representative of such values as proof of payment. To protect against counterfeiting of 
Indicia, modern digital postage metering systems use encryption technology. The postage value and other information 
relating to an indicium are preferably digitally signed, or otherwise cryptographically authenticated, and the information 
and signature are incorporated into the digital postal indicium, 

[0003] Digital postal indicia using encryption technologies are extremely secure. In general, without knowledge of 

15 the proper encryption keys, it is essentially impossible to produce a counterfeit digital indicium, However, digital indicia 
are subject, as are all postal indicia, to "rubber-stamp" counterfeiting when a valid indicium is scanned and reproduced 
on multiple mail pieces. To prevent such "rubber-stamp" counterfeiting, it is known to incorporate information from the 
address block of the mail piece into the postal indicium. Because space on an envelope is limited, typically only a small 
portion of the information in the address block will be incorporated into the indicium. 

20 [0004] In Figure 1, prior art mailing system 10 includes address printer controller 12, address printer 14, postage 
meter 16, and Indicia printer 20, Address printer controller 12 receives address information from a data processing 
system (not shown), generates a bitmap representative of the nominal, or "pristine", image of the address block, and 
controls address printer 12 to print address block A, representative of the address, on envelope E. Meter 16 receives 
postage information, and other information, from the data processing system. Meter 16 also receives characterizing 

25 information descriptive of block A from address printer controller 12. The information received can be either text-based 
or image-based. Text-based information is descriptive of the words or characters making up to the address, (e.g., ASCII 
code) while image-based information is descriptive of the actual printed image in the address block. Meter 1 6 combines 
the characterizing information with the postage value and other information, typically digitally signs the combination, 
generates a bitmap representative of an indicium including the digitally signed combination, and controls indicia printer 

30 20 to print indicium IN on envelope E. When the mail piece is sent to a postal service location, the address block can 
be scanned again, and the information regenerated from the scanned address block compared to information recovered 
from indicium IN, without the need to communicate with the remote mailing system; thus tying indicium IN to the par- 
ticular mail piece. (Note that since indicium IN is cryptographically linked to the address on the mail piece, printer 20 
need not be a secure printer; but can be a general purpose printer which can be controlled by other devices for other 

35 uses.) Commonly assigned, U.S. patent application publication number 2004/0128254, entitled System And Method 
For Mail Destination Address information Encoding Protection And Recovery In Postal Payment in the name of Leon 
A. Pintsov /discloses a system similar to that of the Figure 1 using text-based characterizations of the address block. 
[0005] While useful for its intended purpose, problems remain with the system of Figure 1 and similar systems. It 
has proven difficult to reliably recover textual information from address blocks during the validation process using 

40 available optical character recognition (OCR) techniques. Attempts to increase the robustness of text-based systems 
by incorporation of additional information and/or the use of error correcting codes has resulted in undesirable increases 
in indicia size and computational complexity. Use of image-based characterizing systems, such as those described in 
the above mentioned publication number 2004/01 28254, has been proposed and is believed to substantially overcome 
some of the problems of text-based systems; however, it has proven difficult to form a priori estimates of the robustness 

45 of proposed characterizing algorithms for image-based system, forcing users to undertake extensive trial and error 
testing of various algorithms. This problem is exacerbated by the variation in robustness of characterizing algorithms 
with respect to the particular text block to be characterized. Thus, it is an object of the present invention to provide a 
method and system for estimating the robustness of characterizing algorithms with respect to a particular text block. 
[0006] The above object Is achieved and the disadvantages of the prior art are overcome in accordance with the 

50 subject invention by a method and system for selecting a characterizing algorithm for generating a characterizing 
information descriptor for a selected block of printed material when the printed material is to be scanned from an object 
and compared with the characterizing information descriptor at a location distant from where the block is printed. The 
system of the subject invention is controlled in accordance with the method of the subject invention to: print the block 
on an object; apply each algorithm from a predetermined set of characterizing algorithms to a pristine image of the 

55 block of printed material to generate a plurality of corresponding first characterizing information descriptors for the 
block; determine estimates of robustness, with respect to the block of printed material, for each of the algorithms in 
the set to determine which of the characterizing algorithms is most robust; and select a descriptor generated by the 
algorithm and being so determined to be most robust to be used at the distant location, 
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[0007] In accordance with one aspect of the subject invention, the estimates are determined by filtering the pristine 
digital image of the block of printed material with a print/scan filter to create a filtered image, the print/scan filter sim- 
ulating the expected transformation of the pristine image by printing and scanning processes; applying each algorithm 
from the predetermined set of characterizing algorithms to the filtered image to generate a plurality of corresponding 
5 second characterizing information descriptors for the filtered digital image; and, for each algorithm from the predeter- 
mined set of characterizing algorithms, comparing corresponding the first and the second descriptors to determine 
which of the characterizing algorithms is most robust. 

[0008] In accordance with another aspect of the subject invention the object is a mail piece and the block of printed 
material represents an address and the selected descriptor is comprised in an indicium printed on the mail piece; 
10 whereby the selected descriptor can be recovered from the indicium for use at the remote location. 

[0009] In accordance with another aspect of the subject invention the selected descriptor is one of the second de- 
scriptors. 

[0010] In accordance with yet another aspect of the subject invention the estimates are determined by filtering the 
pristine digital image of the block of printed material with a print/scan filter to create a filtered image, the print/scan 

15 filter simulating the expected transformation of the pristine image by printing and scanning processes; further filtering 
the filtered image with one or more defacing filters, the defacing filters simulating simulate blots, smudges, failure of 
print elements or scanner sensors, or other, similar occasional events which can not easily be incorporated into the 
print/scan filter to create one or more defaced images; applying each algorithm from the predetermined set of charac- 
terizing algorithms to the filtered image and to the one or more defaced images to generate a plurality of corresponding 

20 second characterizing information descriptors for the filtered digital image and one or more pluralities of defaced image 
descriptors corresponding to each of the one or more defaced images; and for each algorithm from the predetermined 
set of characterizing algorithms, comparing corresponding first characterizing information descriptors with correspond- 
ing second characterizing information descriptors and with each of the one or more corresponding defaced image 
descriptors to determine which of the characterizing algorithms is most robust 

25 [001 1] Other objects and advantages of the present invention will be apparent to those skilled in the art from con- 
sideration of the detailed description set forth below and the attached drawings. 

[0012] The present invention is illustrated by way of example, and not by way of limitation, in the figures of the 
accompanying drawings and in which like reference numerals refer to similar elements and in which: 
[001 3] Figure 1 shows a schematic block diagram of a prior art mailing system. 
30 [0014] Figure 2 shows a schematic block diagram of a mailing system in accordance with the subject invention. 
[0015] Figure 3 illustrates a method for abstracting characterizing information descriptive of an address block from 
an image of the address block. 

[001 6] Figure 4 illustrates another method for abstracting characterizing information descriptive of an address block 
from an image of the address block. 
35 [001 7] Figure 5 illustrates another method for abstracting characterizing information descriptive of an address block 
from an image of the address block. 

[0018] Figure 6 shows a flow diagram of the operation of a controller comprised in the system of Figure 2 in accord- 
ance with one embodiment of the subject invention. 

[0019] Figure 7 shows a flow diagram of the operation of the operation of a controller comprised in the system of 

40 Figure 2 in accordance with another embodiment of the subject invention. 

[0020] In Figure 2, mailing system 22 includes address printer controller 12, address printer 14, postage meter 16, 
and indicia printer 20, which are substantially similar to the corresponding prior art elements shown in Figure 1 . System 
22 differs in including data stores 24 and 26 communicating with controller 12. Data store 24 stores a plurality of 
characterizing algorithms, as will be described further below, and data store 26 stores at least a print/scan filter which, 

45 when applied to the pristine image, generates a filtered image which approximates the transformation of the pristine 
image by the printing and scanning processes. In other embodiments, data store 26 stores one or more defacing filters 
which simulate blots, smudges, failure of print elements or scanner sensors, or other, similar occasional events which 
can not easily be incorporated into said print/scan filter to create one or more defaced images, Together, meter 16, 
printer 20, form secure postal indicia printing system 22. 

50 [0021] Three methods for generation of image-based characterizing information, which are believed to provide im- 
proved compactness and robustness in accordance with the above object of the invention, have recently been devel- 
oped by the assignee of the present application and are described below as illustrative of the type of characterizing 
algorithms which can be used with the subject invention. Numerous other algorithms will be apparent to those skilled 
in the art and particular choices of algorithms to be used form no part of the subject invention, except as may be recited 

55 in the claims below and equivalents. Each of these methods is believed to provide a sufficiently high likelihood of 
detection to deter "rubber stamp" counterfeiting, particular by large scale matters, while having a sufficiently low rate 
of false positives that it will not unduly delay mail processing. It is believed that each of these methods will in general 
provide characterizing information which can be specified by a bit stream of approximately 6 to 12 bytes. 
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[0022] A characterizing algorithm in which the characterizing information comprises measurements of the lengths 
of the individual words which make up address A, is shown in Figure 3. Address block A is parsed to identify individual 
words by first identifying line spaces Is by determining the occurrence of large amounts of horizontal white space 
between blocks of printed text, and then identifying word spaces ws by determining the occurrence of large amounts 
5 of vertical white space between blocks of printed text (as shown with respect the first line of address A). Word lengths 
/1 through /9 are then determined for address A. Preferably, word lengths are taken (measured in pixels) from the 
edges of word spaces ws (or the address edges) as shown, but can be taken in any convenient manner, such as along 
the midline of the words. 

[0023] It is believed that using four or fewer bits per word would not be useful in postal applications. Thus, in a 
10 preferred embodiment the number of bits used can be selected to encode all words in the address, and two control 
bits will be sufficient to indicate selection of five to eight bits per word to encode the length of the word. In other 
embodiments, a fixed number of words in the address, for example the first eight, can be scanned at a fixed number 
of bits per word; eight in this case, since control bits would not be needed to specify the number of bits per word. 

15 Example 

[0024] An address such as shown in Figures 3-5 may produce, depending on the print font selected, etc., the 
following results using six bits per word: 



Word# 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Length 
(pixels) 


173 


45 


150 


60 


154 


103 


168 


68 


189 



25 The absolute lengths are then normalized to the range 1 - 63, i.e. 2° - (2 6 -1), yielding: 



Word# 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Length 
(normalized) 


56 


1 


46 


7 


48 


26 


54 


11 


63 



[0025] Another algorithm in which the characterizing information comprises measurements of the number of "outliers' 1 
in each word (or each line) that make up address A, is shown in Figure 4. (By "outliers" herein is meant ascenders or 
descenders and portions capitals of which project beyond thresholds, which are preferably determined by the upper 

35 and lower bounds of lower case letters without ascenders or descenders, such as "a", "c", "e", etc.) Address A is parsed 
to identify individual words, if necessary, by first identifying line spaces Is by determining the occurrence of large 
amounts of horizontal white space between blocks of printed text, and then identifying word spaces ws by determining 
the occurrence of large amounts of vertical white space between blocks of printed text (as shown with respect the first 
line of address A). Otherwise, only the lines need be identified. 

40 [0026] Assuming six bits are allocated per word, the number of upwards (+) and downwards (-) outliers per word can 
be encoded as "xxx/yyy" where x and y are binary digits and xxx is the number of (+) outliers and yyy is the number 
of (-) outliers. 

Example 

45 

[0027] Again taking eight bytes as the space allocated for the address block characterizing information, as shown 
in Figure 4 with respect to the first address line, (+) outliers 32, in word 1; 34, in word 2; and 36, in word 3 are identified 
as exceeding threshold 40, and outlier 42, in word 1, is identified as exceeding threshold 44. Since for address block 
A all of the outliers can be encoded in less than 60 bits, the resulting bit stream is: 

50 

1-O01/0O1-O01/000-10O/0QO-O10/0Q0-01 1/000-001/000-010/000-010/000-101/000-111 

I I I I I I I I I I I 

5 5 code word 1 word2 word3 word4 words word6 word7 word8 *ord9 end 

where code 1 indicates per word characterization and 111 is an end code. (The 111 end code of course implies that 
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no more than six (+) outliers can be recognized in any word, i.e. 110 means 6 or more.) 

[0028] Another algorithm in which the characterizing information comprises a description of the shape of the address 
block is shown in Figure 5. The shape is determined by using a conventional "best fit" scanning algorithm which encloses 
address block A with "best fit" closed curve 50. (It should be understood that various algorithms for generating a best 

5 fit curve will generate different curves. These differences do not affect the subject invention so long as the same 
algorithm is used to generate the curve whose description is incorporated into the indicium and to recover the curve 
from the address block when the indicium is validated,) Preferably, curve 50 is constrained. That is the manner in which 
a curve can be generated and is limited so that the resulting curve is simplified and can be described with limited 
information. In Figure 5, curve 50 is formed from linked straight line segments, such as segment 51, which are limited 

10 to eight "directions" , up (U), down (D), left (L), right I, up-right (UR), up-left (UL), down-right (DR), and down-left (DL); 
viewed as being generated starting in the upper left comer of address block A and traveling clockwise around address 
block A. Preferably, the curve 50 also accounts for spaces between characters, words and lines, treating these spaces 
as equivalent to printed space, so that curve 50 does not become too convoluted and require extensive descriptive 
information. It is within the skill of a person skilled in the art to provide an algorithm which will generate robust and 

*5 compact characterizing information, as described above. 

[0029] The characterizing information, i.e., the description of curve 50, can be encoded in a number of ways. In the 
present example, the characterizing information consists of only the directions, without lengths, of each successive 
line segment. 

20 Example 

[0030] Encoding line segment directions as: 

R = 000, L= 111, U =001 ,D= 110, UR=010, DL= 101, DR=011, UL = 100: and starting at the upper left of address 
block A, curve 50 is described by the bit stream: 

25 

000-01 1-000-010-01 1-000-001-000-1 1 0-000-001-000-1 10-1 1 1 -1 10-000-1 1 0- 

I I I l I I I I I I I I I i I I I 

RDRRURDRRURDRUR DLDRD 

111-110-111-001-111-001-110 

I I I I I I I 

L D L U L U D(end) 



40 Thus, curve 50 can be described in nine bytes, including an end code, which can be indicated by reversal (or repetition) 
of the immediately preceding segment direction. Again, this bit stream is incorporated into the indicium. 
[0031] Programming of a data processor to analyze scan data to perform imaging operations such as identifying 
lines and words, measuring the dimensions of letters and words or fitting a curve to an image in accordance with 
predetermined constraints are well known. Such operations are substantially routine in the character and general pat- 

45 tern recognition arts, for example. Techniques for carrying out such operations are also taught in Handbook of Pattern 
Recognition and Image Processing edited by T Young and K-S Fu, Academic Press, 1986, and need not be discussed 
further here for an understanding of the subject invention. 

[0032] Bit streams, such as those described above, comprise ordered sequences of values which are typically, though 
not necessarily, numeric values associated with words in the address block. (Such bit streams are hereinafter some- 

50 times "characterizing information descriptors" or "descriptors", and such values are hereinafter sometimes "character- 
izations".) As described above, when an indicium is validated, i.e., tied to the mail piece on which it is printed, at a 
distant postal facility the descriptor generated from the pristine image and incorporated into the indicium is compared 
with a descriptor recovered from an image scanned from the address block printed on the mail piece. It will be apparent 
to those skilled in the art that the recovered image will be transformed with respect to the pristine image by the char- 

55 acteristics of the printing and scanning processes, as well as possibly by the occurrence of occasional events such as 
blots. Thus, it is important that the algorithm used to characterize the address block be robust, that is, that it produce 
descriptors that match sufficiently when an indicium is valid, and do not match for invalid indicia, despite small differ- 
ences between the scanned image and the pristine image. It will also be apparent that the robustness of a particular 
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characterizing algorithm can vary for different address blocks. (As a hypothetical example, the above described algo- 
rithm based on word length may be less robust for address blocks printed in a small font while algorithms based on 
the number of outliers, or address block shape may be relatively insensitive to font size.) 

[0033] Figure 6 shows a flow diagram of the operation of controller 12 in accordance with one embodiment of the 
5 subject invention. At step 60 controller 1 2 obtains a pristine digital image, P, of address block A from a conventional 
source (not shown) such as a data processing system for preparing a bulk mailing. At step 62 controller 12 carries out 
printing of address block A in a conventional manner. Preferably, this printing process is carried out concurrently with 
the selection of a characterizing algorithm but, in other embodiments of the subject invention, printing of address block 
A can be carried out sequentially or by a separate processor. 
10 [0034] At step 64, controller 12 inputs a print/scan filter which simulates the printing process of printer 14 and the 
scanning process to be carried out at a remote postal facility from data store 26 and applies it to image P to generate 
a filtered image, F, which approximates the image which will be scanned from the mail piece at the postal facility. And 
at step 66 sets index i equal to 1 and variable R equal to 0. 

[0035] At step 70 controller 1 2 applies the ith characterizing algorithm C| to images P and F to generate corresponding 
15 descriptors C[(P) and Cj(F); each comprising a sequence of M characterizations, or values, CjfP^ through C,(P) M ; C, 
(F)i through 0^(F) M . Then at step 72, controller 12 compares descriptors Cj(P) and C,(F) to estimate a robustness value 
R, for the ith algorithm C,, with respect to a particular image P. 

[0036] The comparison at step 72 is carried out using a comparison algorithm ass ociated with characterizing algo- 
rithm Cj and which preferably is the same comparison algorithm used at the postal facility to compare the descriptor 

20 recovered from the scanned image with the descriptor incorporated into indicium IN. Preferably the comparison is 
carried out on a characterization by characterization basis, comparing each Cj(P)j with the corresponding C,(F)j to 
determine if the characterizations match; i.e. if they are "close enough" as defined by the particular comparison algo- 
rithm used. (As a hypothetical example, where the characterizations are word lengths they may be considered to 
"match" if the lengths differ by no more than one or two units; while if the characterizations are the number of outliers 

25 in a word a "match" may require exact equality.) 

[0037] In a preferred embodiment, once descriptors C,(P) and C,(F) have been compared, an estimate R ( for the 
robustness of algorithm C,, with respect to particular image P, is calculated as: 



30 R, = Total no. of [C,(P)j matching C,(F)j]/M (for j = 1 through M); 

where M is the number of characterizations generated by Cj. (Note that since robustness is defined with respect to 
small changes in the image, in normal use the filters, and the printing and scanning processes, will be such that the 
descriptors C|(P) and Cj(F) will have the same number of characterizations. Otherwise an error condition is generated.) 

35 [0038] Once estimate Rj is determined at step 74 controller 12 determines if Rj is greater than variable R and, if so, 
at step 78 controller 12 sets R =R| and index value I = i. Then, or immediately if Rj is not greater than R, at step 80 
controller 12 sets i = i+1. At step 82 controller 12 determines if i+1 is greater than N, the number of characterizing 
algorithms stored. If not, controller 12 returns to step 70 to test the next algorithm. Otherwise, at step 86 controller 12 
sends I and descriptor C,(P) to meter 16 in a conventional manner for incorporation into indicium IN. The postal facility 

40 can then recover I to identify C, and use C, to validate indicium IN in a conventional manner. In other embodiments, 
descriptors can be self-identified by their format, or, if a relatively small number of algorithms is used, the facility can 
sequentially test using all algorithms, with the assumption that only the algorithm actually used to generate the de- 
scriptor will give meaningful results; so that index value I need not be included in indicium IN. 
[0039] Figure 7 shows a flow diagram of the operation of controller 12 in accordance with another embodiment of 

45 the subject invention. Similarly to the above described embodiment, at step 90, controller 12 obtains pristine digital 
image, P, of address block A, at step 94 carries out printing of address block A concurrently with the selection of a 
characterizing algorithm and, at step 96 inputs a print/scan filter. 

[0040] At step 100, controller 12 inputs defacing filters D-| through D T (described above) and applies each of these 
filters to filtered image F to generate defaced images F'D^ through F*Dj which approximate scanned images of address 
50 blocks which have been defaced by occasional events such as blots. At step 102, controller 12 sets index i equal to 1 
and variable R equal to 0, 

[0041] At step 104 controller 12 applies the ith characterizing algorithm C, to images P, F and F*D 1 through F*Oj to 
generate corresponding descriptors C,(P), C,(F) and CjfF'Dj) through C,(F*D T ); each comprising a sequence of M 
characterizations, or values, C,(P) 1 through C,(P) M ; C j (F) 1 through C,(F) M , etc. Then at step 1 08, controller 1 2 compares 
55 descriptors C|(P) with descriptors C,(F) and 0^*0^) through C,(F*D T ) to estimate a robustness value R, for the ith 
algorithm Cj, with respect to a particular image R 

[0042] In a preferred embodiment, once descriptors Cj(P) and C,(F) have been compared an estimate R| for the 
robustness of algorithm C,, with respect to particular image P, is calculated as: 
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R, = Total no. of: [C,(P)j matching C,^ (for j = 1 through M) + 
C,(P)j matching C,(F*D k )j/M (for j = 1 through M, k =1 through T)]/M(T+1); 

5 

where M is the number of characterizations generated by Cj. 

[0043] Again, similar to the embodiment described above, once estimate R| is determined at step 110 controller 12 
determines if R, is greater than variable R and, if so, at step 112 controller 12 sets R =R ( and index value I = i. Then, 
or immediately if R, is not greater than R, at step 114 controller 12 sets i = i+1. At step 118 controller 12 determines if 
10 i+ 1 is greater than N, the number of characterizing algorithms stored, If not controller 1 2 returns to step 1 04 to test the 
next algorithm. Otherwise, at step 120 controller 12 sends I and descriptor C|(P) to meter 16 in a conventional manner 
for incorporation into indicium IN. The postal facility can then recover I to identify Cj and use C, to validate indicium IN 
in a conventional manner. 

[0044] In other embodiments, whether or not defacing filters are used, descriptor C,(F) can be incorporated into 
15 indicium IN. 

[0045] It is anticipated that other estimates for robustness of characterizing algorithms will be developed as experi- 
ence with different applications is gained or will be apparent to those skilled in the art. Accordingly it should be under- 
stood that, except for particular recitations in the claims below and equivalents thereof, details of particular estimates 
used form no part of the subject invention. 

20 

Claims 

1. A method for selecting a characterizing algorithm for generating a characterizing information descriptor for a se- 
25 lected block of printed material, where said printed material is to be scanned from an object and compared with 

said characterizing information descriptor at a location distant from where said block is printed, said method com- 
prising the steps of: 

a) printing said block on an object; 
30 b) applying each algorithm from a predetermined set of characterizing algorithms to a pristine image of said 

block of printed material to generate a plurality of corresponding first characterizing information descriptors 
for said block; 

c) determining estimates of robustness, with respect to said block of printed material, for each of said algorithms 
in said set to determine which of said characterizing algorithms is most robust; and 
35 d) selecting a descriptor generated by said algorithm and being so determined to be most robust be used at 

said distant location. 

2. A method as described in claim 1 wherein said step c) comprises the sub-steps of: 

40 d) filtering said pristine digital image of said block of printed material with a print/scan filter to create a filtered 

image, said print/scan filter simulating the expected transformation of said pristine image by printing and scan- 
ning processes; 

c2) applying each algorithm from said predetermined set of characterizing algorithms to said filtered image to 
generate a plurality of corresponding second characterizing information descriptors for said filtered digital 
45 image; and 

c3) for each algorithm from said predetermined set of characterizing algorithms, comparing corresponding 
said first and said second descriptors to determine which of said characterizing algorithms is most robust, 

3. A method as described in claim 2 where said object is a mail piece, and said block of printed material represents 
50 an address. 

4. A method as described in claim 3 where said selected descriptor is comprised in an indicium printed on said mail 
piece; whereby said selected descriptor can be recovered from said indicium for use at said remote location. 

55 5. A method as described in claim 4 where said indicium further comprises information identifying said algorithm so 
determined. 

6. A method as described in claim 2 where said selected descriptor is one of said second descriptors. 
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7. A method as described in claim 1 where said object is a mail piece and said block of printed material represents 
an address. 

8. A method as described in claim 7 where said selected descriptor is comprised in an indicium printed on said mail 
5 piece; whereby said selected descriptor can be recovered from said indicium for use at said remote location. 

9. A method as described in claim 8 where said indicium further comprises information identifying said algorithm so 
determined. 

10 10. A method as described in claim 1 wherein said step c) comprises the sub-steps of: 

c1) filtering said pristine digital image of said block of printed material with a print/scan filter to create a filtered 
image, said print/scan filter simulating the expected transformation of said pristine image by printing and scan- 
ning processes; 

15 c2) further filtering said filtered image with one or more defacing filters, said defacing filters simulating simulate 

blots, smudges, failure of print elements or scanner sensors, or other, similar occasional events which can not 
easily be incorporated into said print/scan filter to create one or more defaced images; 
c2) applying each algorithm from said predetermined set of characterizing algorithms to said filtered image 
and to said one or more defaced images to generate a plurality of corresponding second characterizing infor- 

20 mation descriptors for said filtered . digital image and one or more pluralities of defaced image descriptors 

corresponding to each of said one or more defaced images; and 

c3) for each algorithm from said predetermined set of characterizing algorithms, comparing corresponding 
first characterizing information descriptors with corresponding second characterizing information descriptors 
and with each of said one or more corresponding defaced image descriptors to determine which of said char- 
25 acterizing algorithms is most robust. 

11. A method as described in claim 10 where said object is a mail piece and said block of printed material represents 
an address. 

30 12. A method as described in claim 11 where said selected descriptor is comprised in an indicium printed on said mail 
piece; whereby said selected descriptor can be recovered from said indicium for use at said remote location. 

13. A method as described in claim 12 where said indicium further comprises information identifying said algorithm so 
determined. 

35 

14. A method as described in claim 10 where said selected descriptor is one of said second descriptors. 
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